A phone-dependent confidence measure for utterance rejection
نویسندگان
چکیده
An acoustic con dence measure for acceptance/rejection of recognition hypotheses for continuous speech utterances is proposed. This measure is useful for rejecting utterances that are out of domain, or contain out-of-vocabulary words or speech dis uencies. A phone-based approach is implemented so that a single global threshold can be applied to hypothesis rejection for any word sequence. Phone con dence is computed for each frame of speech as the posterior phone probability given the acoustic observation. Word sequence con dence is evaluated as the average phone con dence, either by weighting all frames equally or by normalizing by phone duration. The con dence measure is tested on a database of spoken company names. When normalized by phone duration, it achieves, in some cases with less computational expense, rejection performance comparable to a baseline system implementing a common ller-model approach. When all frames are equally weighted, performance is substantially poorer.
منابع مشابه
Confidence measures derived from an acceptor HMM
In this paper we define a number of confidence measures derived from an acceptor HMM and evaluate their performance for the task of utterance verification using the North American Business News (NAB) and Broadcast News (BN) corpora. Results are presented for decodings made at both the word and phone level which show the relative profitability of rejection provided by the diverse set of confiden...
متن کاملUtterance verification/rejection for speaker-dependent and speaker-independent speech recognition
In this paper we discuss the differences of the utterance verification/rejection between speaker-dependent and speaker-independent speech recognition. A general guideline will be given to handle the verification task for the two type speech recognitions. We also discuss some specific methods have been commonly deployed in various speech recognition systems. In this study we describe two example...
متن کاملDetection of recognition errors based on classifiers trained on artificially created data
This paper wishes to contribute to the solution of the problem occurring when an automatic speech recognition system does not recognize an input utterance correctly. The solution is usually based on a utilization of a confidence measure (CM) which is assigned to each recognized word and which informs a user or a higher level module on the belief that the recognized word has been really said. Th...
متن کاملConfidence measures from local posterior probability estimates
In this paper we introduce a set of related confidence measures for large vocabulary continuous speech recognition (LVCSR) based on local phone posterior probability estimates output by an acceptor HMM acoustic model. In addition to their computational efficiency, these confidence measures are attractive as they may be applied at the state-, phone-, wordor utterance-levels, potentially enabling...
متن کاملUtterance Verification Using State-Level Log-Likelihood Ratio with Frame and State Selection
This paper suggests utterance verification system using state-level log-likelihood ratio with frame and state selection. We use hidden Markov models for speech recognition and utterance verification as acoustic models and anti-phone models. The hidden Markov models have three states and each state represents different characteristics of a phone. Thus we propose an algorithm to compute state-lev...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1996